Measure based metrics for aggregated data

نویسنده

  • Victor J. Rayward-Smith
چکیده

Aggregated data arises commonly from surveys and censuses where groups of individuals are studied as coherent entities. The aggregated data can take many forms including sets, intervals, distributions and histograms. The data analyst needs to measure the similarity between such aggregated data items and a range of metrics are reported in the literature to achieve this (e.g. the Jaccard metric for sets and the Wasserstein metric for histograms). In this paper, a unifying theory based on measure theory is developed that establishes not only that known metrics are essentially similar but also suggests new metrics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Uncertainty Measurement for Ultrasonic Sensor Fusion Using Generalized Aggregated Uncertainty Measure 1

In this paper, target differentiation based on pattern of data which are obtained by a set of two ultrasonic sensors is considered. A neural network based target classifier is applied to these data to categorize the data of each sensor. Then the results are fused together by Dempster–Shafer theory (DST) and Dezert–Smarandache theory (DSmT) to make final decision. The Generalized Aggregated Unce...

متن کامل

Generalized Aggregate Uncertainty Measure 2 for Uncertainty Evaluation of a Dezert-Smarandache Theory based Localization Problem

In this paper, Generalized Aggregated Uncertainty measure 2 (GAU2), as a newuncertainty measure, is considered to evaluate uncertainty in a localization problem in which cameras’images are used. The theory that is applied to a hierarchical structure for a decision making to combinecameras’ images is Dezert-Smarandache theory. To evaluate decisions, an analysis of uncertainty isexecuted at every...

متن کامل

Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines

Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...

متن کامل

A generalized approach for producing, quantifying, and validating citizen science data from wildlife images

Citizen science has the potential to expand the scope and scale of research in ecology and conservation, but many professional researchers remain skeptical of data produced by nonexperts. We devised an approach for producing accurate, reliable data from untrained, nonexpert volunteers. On the citizen science website www.snapshotserengeti.org, more than 28,000 volunteers classified 1.51 million ...

متن کامل

Pattern detection in null model analysis

Null model analysis has been a popular tool for detecting pattern in binary presence–absence matrices, and previous tests have identified algorithms and metrics that have good statistical properties. However, the behavior of different metrics is often correlated, making it difficult to distinguish different patterns. We compared the performance of a suite of null models and metrics that have be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Intell. Data Anal.

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2011